Search CORE

7 research outputs found

A framework for bibliographic recommendation system based on Heterogeneous Retrieval Model?

Author: Anthony Poonam
Bhowmick Plaban Kumar
Publication venue: 'University of Waikato'
Publication date: 01/01/2018
Field of study

In this paper, we propose an architectural framework for recommending heterogeneous resources in a digital library.We present an outline of our proposed recommendation framework, and discuss brie its performance over SpringerNature SciGraph¹ dataset

Research Commons@Waikato

Segmenting Scientific Abstracts into Discourse Categories: A Deep Learning-Based Approach for Sparse Labeled Data

Author: Banerjee Soumya
Bhowmick Plaban Kumar
Chattopadhyay Samiran
Das Parthapratim
Sanyal Debarshi Kumar
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 27/05/2020
Field of study

The abstract of a scientific paper distills the contents of the paper into a short paragraph. In the biomedical literature, it is customary to structure an abstract into discourse categories like BACKGROUND, OBJECTIVE, METHOD, RESULT, and CONCLUSION, but this segmentation is uncommon in other fields like computer science. Explicit categories could be helpful for more granular, that is, discourse-level search and recommendation. The sparsity of labeled data makes it challenging to construct supervised machine learning solutions for automatic discourse-level segmentation of abstracts in non-bio domains. In this paper, we address this problem using transfer learning. In particular, we define three discourse categories BACKGROUND, TECHNIQUE, OBSERVATION-for an abstract because these three categories are the most common. We train a deep neural network on structured abstracts from PubMed, then fine-tune it on a small hand-labeled corpus of computer science papers. We observe an accuracy of 75% on the test corpus. We perform an ablation study to highlight the roles of the different parts of the model. Our method appears to be a promising solution to the automatic segmentation of abstracts, where the labeled data is sparse.Comment: to appear in the proceedings of JCDL'202

arXiv.org e-Print Archive

Crossref

Generation of Highlights from Research Papers Using Pointer-Generator Networks and SciBERT Embeddings

Author: Bhowmick Plaban Kumar
Chattopadhyay Samiran
Das Partha Pratim
Rehman Tohida
Sanyal Debarshi Kumar
Publication venue
Publication date: 01/01/2023
Field of study

Nowadays many research articles are prefaced with research highlights to summarize the main findings of the paper. Highlights not only help researchers precisely and quickly identify the contributions of a paper, they also enhance the discoverability of the article via search engines. We aim to automatically construct research highlights given certain segments of the research paper. We use a pointer-generator network with coverage mechanism and a contextual embedding layer at the input that encodes the input tokens into SciBERT embeddings. We test our model on a benchmark dataset, CSPubSum and also present MixSub, a new multi-disciplinary corpus of papers for automatic research highlight generation. For both CSPubSum and MixSub, we have observed that the proposed model achieves the best performance compared to related variants and other models proposed in the literature. On the CSPubSum data set, our model achieves the best performance when the input is only the abstract of a paper as opposed to other segments of the paper. It produces ROUGE-1, ROUGE-2 and ROUGE-L F1-scores of 38.26, 14.26 and 35.51, respectively, METEOR F1-score of 32.62, and BERTScore F1 of 86.65 which outperform all other baselines. On the new MixSub data set, where only the abstract is the input, our proposed model (when trained on the whole training corpus without distinguishing between the subject categories) achieves ROUGE-1, ROUGE-2 and ROUGE-L F1-scores of 31.78, 9.76 and 29.3, respectively, METEOR F1-score of 24.00, and BERTScore F1 of 85.25, outperforming other models.Comment: 18 pages, 7 figures, 7 table

arXiv.org e-Print Archive

Directory of Open Access Journals

Reader Perspective Emotion Analysis in Text through Ensemble based Multi-Label Classification Framework

Author: Plaban Kumar Bhowmick
Publication venue: 'Canadian Center of Science and Education'
Publication date: 01/01/2014
Field of study

Crossref

A review of author name disambiguation techniques for the PubMed bibliographic database

Author: Blei DM
Chen T
Debarshi Kumar Sanyal
Ester M
Fan X
Humphrey SM
Khabsa M
Kim K
Partha Pratim Das
Pereira DA
Pflugrad A
Plaban Kumar Bhowmick
Schilder I
Strotmann A
Sun Y
Tang J
Torvik VI
Treeratpituk P
Yin X
Zadrozny B
Zhang Q
Zhang T
Zhang Y
Zhao D
Zhao Z
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

A Tutorial on Multilabel Learning

Author: Antenreiter Martin
Bhowmick Plaban K.
Blockeel Hendrik
Briggs Forrest
Brinker Klaus
Brinker Klaus
Ciarelli Patrick Marques
Clare Amanda
Crammer Koby
de Carvalho André
Elisseeff Andre
Eva Gibaja
Jiang Aiwen
Katakis Ioannis
Kawai Kentaro
Kumar Neeraj
Lafferty J.
LAWS.
Lewis David D.
Loza Eneldo
Loza Eneldo
McCallum Andrew Kachites
MLD.
MLD.
NIPS.
Pestian J. P.
Read Jesse
Read Jesse
Sebastián Ventura
Sechidis Konstantinos
Shao Huan
Skabar Andrew
Spat Stephan
Tai Farbound
Tenenboim Lena
Trohidis K.
Tsoumakas Grigorios
Tsoumakas Grigorios
Yang Yiming
Zhang Min-Ling
Ávila Jose L.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref